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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Hillman, Jennifer L. 

6 Lai, Preeti 

7 Corley, Neil C. 

8 Shah, Purvi 

11 (ii) TITLE OF INVENTION: HUMAN CITRATE SYNTHASE HOMOLOG 

13 (iii) NUMBER OF SEQUENCES: 3 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

17 (B) STREET: 3174 Porter Drive 

18 (C) CITY: Palo Alto 

19 (D) STATE: CA 

20 (E) COUNTRY: USA 

21 (F) ZIP: 94304 

23 (v) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Diskette 

25 (B) COMPUTER: IBM Compatible 

26 (C) OPERATING SYSTEM: DOS 

27 (D) SOFTWARE: FastSEQ for Windows Version 2 
29 (vi) CURRENT APPLICATION DATA: 

C--> 30 (A) APPLICATION NUMBER: US/09/996,223 

C--> 31 (B) FILING DATE: 27-Nov-2001 

32 (C) CLASSIFICATION: 

34 (vii) PRIOR APPLICATION DATA: 

35 (A) APPLICATION NUMBER: 08/970,134 

36 (B) FILING DATE: 

38 (viii) ATTORNEY/AGENT INFORMATION: 

39 (A) NAME: Billings, Lucy J. 

40 (B) REGISTRATION NUMBER: 36,749 

41 (C) REFERENCE/DOCKET NUMBER: PF-0425 US 
4 3 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: 650-855-0555 

45 (B) TELEFAX: 650-845-4166 

46 (C) TELEX: 
49 (2) INFORMATION FOR SEQ ID NO : 1: 

51 (i) SEQUENCE CHARACTERISTICS: 

52 (A) LENGTH: 466 amino acids 

53 (B) TYPE: amino acid 

54 (C) STRANDEDNESS : single 

55 (D) TOPOLOGY: linear 

57 (vii) IMMEDIATE SOURCE: 

58 (A) LIBRARY: ENDCNOT03 
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SEQUENCE DESCRIPTION: SEQ ID NO: 
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290 








295 








300 









101 Asn Gin Glu Val Leu Val Trp Leu Thr Gin Leu Gin Lys Glu Val Gly 

102 305 310 315 320 

103 Lys Asp Val Ser Asp Glu Lys Leu Arg Asp Tyr He Trp Asn Thr Leu 

104 325 330 335 

105 Asn Ser Gly Arg Val Val Pro Gly Tyr Gly His Ala Val Leu Arg Lys 

106 340 345 350 

107 Thr Asp Pro Arg Tyr Thr Cys Gin Arg Glu Phe Ala Leu Lys His Leu 

108 355 360 365 

109 Pro Asn Asp Pro Met Phe Lys Leu Val Ala Gin Leu Tyr Lys He Val 
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Arg 


Ala 
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Arg 
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Ser 


Gly 
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465 

































124 (2) INFORMATION FOR SEQ ID NO : 2 : 

126 (i) SEQUENCE CHARACTERISTICS: 

127 (A) LENGTH: 2131 base pairs 

128 (B) TYPE: nucleic acid 

129 (C) STRANDEDNESS : single 

130 (D) TOPOLOGY: linear 

132 (vii) IMMEDIATE SOURCE: 

133 (A) LIBRARY: ENDCNOT03 

134 (B) CLONE: 2171653 

136 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

138 CGCCGGTTCG TCTACTCTTT CCTTCAGCCG CCTCCTTTCA ACCTTGTCAA 

139 CGGCCTCTGG TGCAGCGGCG GCGGCTCCTG TTCCTGCCGC AGCTCTCTCC 
14 0 TCCCCACCAG ■ ATCCCGGAGA TCGCCCGCCA TGGCTTTACT TACTGCGGCC 

141 TGGGAACCAA GAATGCATCT TGTCTTGTTC TTGCAGCCCG GCATGCCAGT 

142 CGAATTTGAA AGACATATTG GCTGACCTGA TACCTAAGGA GCAGGCCAGA 
14 3 TCAGGCAGCA ACATGGCAAG ACGGTGGTGG GCCAAATCAC TGTGGACATG 
144 GCATGAGAGG CATGAAGGGA TTGGTCTATG AAACATCAGT TCTTGATCCT 
14 5 TCCGTTTCCG AGGCTTTAGT ATCCCTGAAT GCCAGAAACT GCTACCCAAG 
14 6 GGGAAGAACC CCTGCCTGAG GGCTTATTTT GGCTGCTGGT AACTGGACAT 
14 7 AGGAACAGGT ATCTTGGCTC TCAAAAGAGT GGGCAAAGAG GGCAGCTCTG 
14 8 TGGTCACCAT GCTGGACAAC TTTCCCACCA ATCTACACCC CATGTCTCAG 
14 9 CTGTTACAGC CCTCAACAGT GAAAGTAACT TTGCCCGAGC ATATGCACAG 

150 GAACCAAGTA CTGGGAGTTG ATTTATGAAG ACTCTATGGA TCTAATCGCA 

151 GTGTTGCAGC AAAGATCTAC CGAAATCTCT ACAGAGAAGG CAGCGGTATT 

152 ACTCTAACCT GGACTGGTCT CACAATTTCA CCAACATGTT AGGCTATACT 

153 TCACTGAGCT CACGCGCCTG TACCTCACCA TCCACAGTGA CCATGAGGGT 

154 GTGCCCATAC CAGCCATTTG GTGGGCAGTG CCCTTTCCGA CCCTTACCTG 

155 CAGCCATGAA CGGGCTGGCA GGGCCTCTCC ATGGACTGGC AAATCAGGAA 

156 GGCTAACACA GCTGCAGAAG GAAGTTGGCA AAGATGTGTC AGATGAGAAG 

157 ACATCTGGAA CACACTCAAC TCAGGACGGG TTGTTCCAGG CTATGGCCAT 

158 GGAAGACTGA TCCGCGATAT ACCTGTCAGC GAGAGTTTGC TCTGAAACAC 

159 ACCCCATGTT TAAGTTGGTT GCTCAGCTGT ACAAGATTGT GCCCAATGTC 

160 AGGGTAAAGC CAAGAATCCT TGGCCCAATG TAGATGCTCA CAGTGGGGTG 

161 ATTATGGCAT GACGGAGATG AATTACTACA CGGTCCTGTT TGGGGTGTCA 

162 GTGTACTGGC ACAGCTCATC TGGAGCCGAG CCTTAGGCTT CCCTCTAGAA 
16 3 CCATGAGCAC AGAGGGTCTG ATGAAGTTTG TGGACTCTAA GTCAGGGTAA 



CCCGTCGGCG 60 

CTTTCTTACC 120 

GCCCGGCTCT 180 

GCTTCCTCCA 24 0 

ATTAAGACTT 300 

ATGTATGGTG 360 

GATGAGGGCA 420 

GCTAAGGGTG 4 80 

ATCCCAACAG 540 

CCTTCCCATG 600 

CTCAGTGCAG 660 

GGTATCAGCC 720 

AAGCTACCTT 780 

GGGGCCATTG 840 

GATCATCAGT 900 

GGCAATGTAA 960 

TCCTTTGCAG 1020 

GTGCTTGTCT 1080 

TTACGAGACT 1140 

GCAGTACTAA 1200 

CTGCCTAATG 1260 

CTCTTAGAGC 1320 

CTGCTCCAGT 1380 

CGAGCATTGG 144 0 

AGGCCCAAGT 1500 

AACTGGAGAC 156 0 
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164 TGGGTGAAAG TGACTACCAG AAAGTGAGGA AGCCTAAATA AAAAGTATAC TTTTGTTTCA 1620 

165 GGGGGCCTTT AAAGACTTAA GATTAAATTA TATCTGAGGC ACTGATAATA TGTTTGAGGT 1680 

166 TAAAATATAA ATTAAGACTT TAAAAGATGA AAAATGGTCC CTTCTTCCCT AATCAGCTCC 174 0 

167 CTTCCCCTGC CTGGTATGAG TTGCCCATCA TACGCATGGT CCTGGAGGAT GACCAGGACT 1800 

168 AATGCATGTG GTATGAGTAG GTTTGGCCCC CTCACTATCT CTAGAGTGAG AATCTGGCTC 1860 

169 CTGTTTCCAT GGGTCAAAGC CGGTTGCAGA GAATCTGTAG TCACTTTGGA GCTTTAGCTT 1920 

170 CTCTGCCAAG CCCTCAATAA GCCAGCAAAC CAGGACTCTG CCCCTTCTGT TTCCATAGGA 1980 

171 ATCATGTTGG ATAGTCAGCT GTACCAAGCC CCTTGGCCCT CTCCCATGCA CACAAACACC 204 0 

172 TCCTAGCAAG ACCTGTTGGT TAGCTGGACA TGCTTTGGCA ATTTTTTTAT ACTACCAAGT 2100 

173 GACCATATTG GCATGGCATT TTTTGGTGAT G 2131 
175 (2) INFORMATION FOR SEQ ID NO : 3: 

177 (i) SEQUENCE CHARACTERISTICS: 

178 (A) LENGTH: 464 amino acids 

179 (B) TYPE: amino acid 

180 (C) STRANDEDNESS : single 

181 ( D) TOPOLOGY: linear 

183 (vii) IMMEDIATE SOURCE: 

184 (A) LIBRARY: GenBank 

185 (B) CLONE: 164419 

187 (xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3: 

189 Met Ala Leu Leu Thr Ala Ala Ala Arg Leu Phe Gly Ala Lys Asn Ala 

190 1 5 10 15 

191 Ser Cys Leu Val Leu Ala Ala Arg His Ala Ser Ala Ser Ser Thr Asn 

192 20 25 30 

193 Leu Lys Asp lie Leu Ala Asp Leu lie Pro Lys Glu Gin Ala Arg lie 

194 35 40 45 

195 Lys Thr Phe Arg Gin Gin His Gly Asn Thr Val Val Gly Gin lie Thr 

196 50 55 60 

197 Val Asp Met Met Tyr Gly Gly Met Arg Gly Met Lys Gly Leu Val Tyr 

198 65 70 75 80 

199 Glu Thr Ser Val Leu Asp Pro Asp Glu Gly lie Arg Phe Arg Gly Tyr 

200 85 90 95 

201 Ser lie Pro Glu Cys Gin Lys Met Leu Pro Lys Ala Lys Gly Gly Glu 

202 100 105 110 

203 Glu Pro Leu Pro Glu Gly Leu Phe Trp Leu Leu Val Thr Gly Gin lie 

204 115 120 125 

205 Pro Thr Glu Glu Gin Val Ser Trp Leu Ser Lys Glu Trp Ala Lys Arg 

206 130 135 140 

207 Ala Ala Leu Pro Ser His Val Val Thr Met Leu Asp Asn Phe Pro Thr 

208 145 150 155 160 

209 Asn Leu His Pro Met Ser Gin Leu Ser Ala Ala lie Thr Ala Leu Asn 

210 165 170 175 

211 Ser Glu Ser Asn Phe Ala Arg Ala Tyr Ala Glu Gly lie His Arg Thr 

212 180 185 190 

213 Lys Tyr Trp Glu Leu lie Tyr Glu Asp Cys Met Asp Leu lie Ala Lys 

214 195 200 205 

215 Leu Pro Cys Val Ala Ala Lys lie Tyr Arg Asn Leu Tyr Arg Glu Gly 

216 210 215 220 

217 Ser Ser lie Gly Ala lie Asp Ser Lys Leu Asp Trp Ser His Asn Phe 
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